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' In this work, nonparametric log-rank-type statistical tests are in- 

troduced in order to verify homogeneity of purely discrete variables 
subject to arbitrary right-censoring for infinitely many categories. 
I ^ | In particular, the Cramer-von Mises test statistics for discrete mod- 

els under censoring is established. In order to introduce the test, 
we develop the weighted log-rank statistics in a general multivariate 
discrete setup which complements previous fundamental results of 
l Gill [13] and Andersen et al. [5]. Due to the presence of persistent 

^-H ' jumps over the unbounded set of categories, the asymptotic distri- 

. bution of the test is not distribution-free. The statistical test for a 

• large class of weighted processes is described as a weighted series 

'-pi | of independent chi-squared variables whose weights can be consis- 

■ tently estimated and the associated limiting covariance operator can 
£h be infinite-dimensional. The test is consistent to any alternative hy- 

pothesis and, in particular, it allows us to deal with crossing hazard 
functions. We also provide a simulation study in order to illustrate 
the theoretical results. 
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1. Introduction. Discrete data analysis has great importance in sev- 
eral fields such as economics, biology, medicine, etc. Typically, discrete time 
data may occur either because the underlying data generating process is 
intrinsically discrete one or because they are discretely recorded. In most 
situations, statistical procedures based on continuous or mixed distributions 
cannot be directly applied to the purely discrete cases due to the lack of a 
continuum amount of information needed to validate the asymptotic results. 

Frequently, one typically faces the problem of performing a data analysis 
along a time horizon subject to censoring, i.e., when some data at hand 
have occurred only within certain periods of time. To be more specifically, a 
general right-censoring scheme can be described as follows. Let Wf, . . . , Wn p 
be independent positive random variables representing the survival times or 
times to some events of n p items in a population p = 1, . . . , J. The corre- 
sponding distribution function and intensity function of the p-th population 
are denoted by F' p and h p , respectively. A typical situation occurs when 
l are censored on the right by independent positive random vari- 
ables {Cm}m=i- These censored variables C m are also assumed independent 
w.r.t W m - Thus, in this general random censorship model one can only ob- 
serve 

X m = min{W* , C m }, S m = =w * } 

where 5 m indicates whether W m is censored or not. 

In a large number of applications it is of major importance to test the 
homogeneity of populations in the presence of censoring. A major problem 
in the literature is the development of nonparametric methods to test the 
null hypothesis 

H :F 1 (£) = --- = F J (£); I E X, 

where X is the domain which encodes e.g the lifetime in a typical problem 
in survival analysis. The test should be consistent to a large class of alterna- 
tive hypotheses. A lot of different test procedures have been proposed and 
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studied so far [see e.g [20] and other references therein]. One of the most 
important nonparametric methods to verify Hq under random censorship is 
the well-known weighted log-rank test successfully developed in the setup 
of the Martingale theory proposed by Aalen [1] in survival analysis. See 
e.g the works [[2], [3], [6], [21] [13], [12]] and other references therein for this 
approach. The weighted log-rank test is one of the pillars of modern sur- 
vival analysis. In fact, it is the most commonly used nonparametric test to 
compare two or more continuous populations with data that are subject to 
censoring. In spite of the large flexibility of the log-rank test, it may not ex- 
hibit good power to deal with non-proportional intensity functions. This fact 
is a major drawback of the log-rank theory and it is discussed in several ap- 
plications with continuous distributions. See e.g Klein and Moeschberger [21] 
for a detailed discussion on this matter. 

While there is a large number of works about nonparametric methods 
for continuous distributions subject to censoring [see e.g [12], [6] and [3] 
and other references therein], it is rather surprising that only few works 
investigate methods for lifetime discrete data. See e.g. the works [16], [15] 
and [19]. The study of nonparametric tests for Hq and consistent w.r.t any 
alternative hypotheses for purely discrete distributions with full support 
under censoring is clearly rather important in many applications. This is 
the program we start to carry out in this work. We tackle this problem by 
naturally considering the weighted linear log-rank statistics given by 



LR q (n* 



n*,£) h n «(£)-h n «i(l) 



n 



£ N , r > 1, 



where £7 n * is a suitable empirical weighted process, h Up is the Kaplan- 
Meier estimator for the intensity function h p and the variable r encodes 
the number of categories which lives in an unbounded subset of the set of 
natural numbers N. A large amount of attention in this article is devoted to 
the study of the asymptotic behavior of the following statistics 



oo J— 1 

(1.1) CVM(n*) :=J2Y, LR l( n ^ r)4> 2 q (n*,r), n* G N J , 

r=l q=l 

for suitable choices of weights cf> q and Un% . The statistics (1.1) is reminis- 
cent from the classical Cramer-von Mises statistics largely used to compare 
continuous distributions under censoring. In this case, standard arguments 
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based on Gaussian processes may be used to prove the correspondent asymp- 
totic limit. See e.g [28], [6] and [21] and other references therein. In the 
purely discrete case, the study of the asymptotic behavior of (1.1) is not 
trivial. In this case, there is no obvious (if any) underlying Gaussian process 
which describes the asymptotic behavior of (1.1). The reason is the occur- 
rence of infinitely many persistent jumps as the sample size goes to infinity 
which causes some difficulties in establishing Lindeberg-type conditions. 

Very general and fundamental asymptotic results for the log-rank statis- 
tics was established in the eighties by Gill [13] and Andersen et al [5]. The 
key point of their asymptotic argument is based on the assumption that 
the size of the jumps of the underlying martingales goes to zero due to a 
Lindeberg-type condition. This of course does not hold when we are in a gen- 
eral discrete setup. In order to recover the discrete case, one needs to accom- 
modate the relevant information on each category of the variable of interest. 
See e.g Murphy [24] and Lipster and Shiryaev [22] for a detailed discussion 
on this matter. To our best knowledge only few works have tried to tackle 
this problem. Gill [13] proposed to spread the jumps at each category of the 
variable of interest over a neighborhood. His argument provides amenable 
time-changed processes whose their persistent jumps could in principle be 
asymptotically controlled under suitable conditions. See remarks after The- 
orem 4.2.1 in Gill [13]. However, his approach is far from being simple and 
it is not clear how one would apply his argument to a concrete discrete case 
in applications. 

Along the lines developed by Gill [13] and Andersen et al. [5], other 
works [see e.g [18] and other references therein] study asymptotic properties 
of different classes of tests for Hq but always restricted to continuous distri- 
butions. Another related work is Murphy [24] who proved a central limit the- 
orem in a rather general discrete setup but the resulting weighted processes 
do not recover the log-rank statistics. Stute [26] and Akritas [4] provide cen- 
tral limit theorems under censorship with distributions with atoms. They 
essentially study limit theorems for integrals w.r.t Kaplan-Meier estimators 
but they are not able to recover the log-rank setup. 

In view of the remarks raised above, we derive one asymptotic result 
in the setup of the discrete log-rank theory towards applications to the 
statistics (1.1). Our approach is rather different from Gill [13]. We introduce 
suitable underlying discrete-time nitrations (over the samples) which allow 
us to get rid the unavoidable persistent asymptotic jumps of the underlying 
martingales. See Remark 3.1 for more details. With these nitrations at hand, 
our first result (Theorem 3.1) applied to the log-rank statistics is solely based 
on soft arguments and well-known machinery from Martingale theory. 
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With the weighted log-rank asymptotic result at hand, we have prepared 
the basis for the asymptotic distribution (Theorem (5.1)) of the Cramer- 
von Mises statistics in (1.1) for independent discrete populations under 
arbitrary right-censoring and with infinitely many categories. In particu- 
lar, we provide a statistical test for Hq which is consistent to any alter- 
native hypothesis. Our test recovers previous works on discrete versions of 
Cramer-von Mises [23, 8, 27] for goodness of fit, without censoring and 
finite number of categories when applied to suitable choices of weighted 
processes. Moreover, the proposed test statistics supports a large class of 
empirical weighted processes which includes Fleming-Harrington [12] and 
Tarone-Ware [29] weights. 

Contrary to the classical continuous case (see e.g [28]), the asymptotic 
distribution of the Cramer-von Mises statistic is not distribution-free be- 
cause of the inherent nature of the jumps. Nevertheless, the underlying dis- 
crete structure allows us to write it as a weighted series of independent 
chi-squared variables with one degree of freedom where the weights can be 
consistently estimated from the data even when the limiting covariance op- 
erator is infinite-dimensional. 

In order to illustrate the importance of our results to data analysis, we 
briefly compare classical nonparametric tests based on continuous distribu- 
tions with the Cramer-von Mises test of this paper in a discretely recorded 
data set presenting crossing hazard functions. We show that the methodol- 
ogy developed in this paper allows us to detect crossing hazard functions 
when classical methodologies based on continuous distributions fail. 

In order to deal with infinitely many categories, the limit theorems (see 
Theorems 3.2 and 5.1) which describe the asymptotic behavior of (1.1) has 
to be worked out in the Hilbert space of square summable real sequences. 
In particular, Theorems 3.2 and 5.1 can be related to the work of Dedecker 
and Merleved [11]. In this work, they give necessary and sufficient conditions 
for a general stationary sequence of Hilbert space valued-random variables 
to satisfy the conditional central limit theorem. In particular they apply 
their results to characterize the asymptotic distribution of the Cramer-von 
Mises statistics essentially involving the empirical distribution function for 
continuous and/or discrete random variables. Unfortunately, their results 
do not recover censored random variables. Our central limit theorem com- 
plements [11] in the particular case of discrete variables arising in a right- 
censorship model where the empirical distribution function is not available 
for analysis. 

The remainder of this paper is organized in the following way. After fixing 
the notation and recalling some basics of the inference of the multiplicative 
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intensity model in Section 2, we formulate and state the main asymptotic 
results of this article in Sections 3, 4 and 5. Section 6 reports a simulation 
study and the proofs of Theorems 3.1 and 3.2, Proposition 3.1 and Theo- 
rem 5.1 are reported in Sections 7, 8, 9 and 10, respectively. The Appendix 
contains some technical results used in Theorem 3.1. 

2. Inference for the Multiplicative Intensity Model for Discrete 
Random Variables. In this section, we introduce the basic notation and 
the discrete model used in this paper. Let N be the set of non-negative in- 
tegers and let W and C be two independent discrete N-valued random vari- 
ables. The discrete random variable W describes the event of interest, while 
the discrete random variable C denotes the censoring variable. The reader 
may think C as the random variable which describes the censoring in a given 
statistical problem. Let us hx an underlying probability space (£l,J~, IP). 
Throughout this paper, we make use of the following notation: if Z and Y 
are two adapted processes, we write (Y.Z)(i) := Y(0)Z(0)+Y?e=i Y(£)AZ(£) 
for i > 1, where AY(£) = Y{£) — Y{£ — 1). In order to distinguish discrete- 
time stochastic integrals from simple Riemman sums, if N is a predictable 
process and M is a martingale, we shall write the correspondent discrete- 
time stochastic integral as J*N(£)dM(£) = (N.M)(i). Moreover, if M is 
discrete-time martingale then (M) denotes the usual predictable bracket. 

We fix once and for all, a natural number J > 1 and we write J = 
{1, . . . , J}. In the remainder of this paper, we denote n* := {n\, . . . ,nj) G N J 
and Nj := maxjni, . . . , nj}. Here n q denotes the size of the random sample 
at hand correspondent to the population q G J . The symbols V and A will 
denote the maximum and minimum between real numbers, respectively. Let 
us now describe the general right-censorship discrete random model of this 
work. For a given p G J, let (W p , C p ) be a given population where W p and 
C p are independent discrete random variables which must be interpreted as 
W and C, respectively. Let X p be a discrete random variable defined by 

X p := W p A C p , p G J. 

For any p G J , we take independent random samples { (Wf , Cf) , ■ ■ ■ , (Wn p , Cn p )} 
from the population (W P ,C P ) where 1 < n p < oo. To shorten notation, 
throughout this paper we assume that ¥[C P = 0] = PfW^ = 0] = for any 
p G J . With these random samples at hand, we introduce 

and the counting processes 
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for m = l,...,n p and i > 0. The counting processes associated with the 
p-th random sample are given by 

lip Tip 

(2.1) R^(i):=J2RP m (i) and R c > n *{i) := £ ^(i), i > 0. 

m=l m=l 

In order to take into account all the information generated by the random 
sample at hand, we define 

Up 

jr n p ._ \/ /IP 
J i V m,*' 

m=l 

where 

^ :=a(Ai^^),Ai^(i);l<i<i), 

for each i > 1 and m = 1, . . . ,n p . We also set J 7 ^ := A p mQ := {0,^} for 
m = 1, . . . , n p and p ^ J . Here, for a given family of random variables D 
the class ct(D) is the smallest sigma-algebra making all of D measurable. 
Moreover, \J is the smallest sigma-algebra generated by a union of sigma- 
algebras. 

In this paper, several types of nitrations will play different rules. The 
filtration 

will be used to perform the Doob-Meyer decomposition for the counting 
process R np as follows. By the very definition, for each m G {1, . . . , n p } and 
p £ J the Doob-Meyer decomposition of R p m w.r.t A„ := {A P ni ,i > 0} 
reads 

R p m (i)=Y*(i)+NV l (i), i>\ 

where Nm is the compensator of Rm and Ym is the associated martingale. 
Here i&(0) = 1^,(0) = JV^(0) = a.s. The definition of the filtration W n ? 
yields the following Doob-Meyer decomposition 

n v 

R"p(i) = J2 (YP(i) + N%(tj) =: Y n ?(i) + N n *(i), i > 1. 

m=l 
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By the very definition, the following F np -decomposition holds 

i 

(2.2) R n r(i) = Y n "(i) + J2v np (£)^H p (£) i > 1, 

i=i 

where 

^(•) := X>™(-) and ff P 0):=E^')- 

m=l j=l 

Here h p is the intensity function associated to the random variable of interest 
W p which can be written 



provided that ¥[W P > j] > 0. 

Remark 2.1. 5y construction, one should notice that each Am-martingale 
Ym is also a martingale w.r.t ¥ Up for every m = l,...,n p . Moreover, the 
martingales Ym and Yj are independent for any m ^ j in {1, . . . , n p }. As a 
consequence, the following representation for the predictable bracket holds 

i 

(Y n ")(i) = ynp (t)h p (£) [1 - h p (£)} , i > 1. 
e=i 

In the remainder of this paper, we denote 



= F[X p >£]; p£j,£>0, 

and whenever necessary, we can always assume that for each q G J there 
exists a category i such that q (i) > 0. This ensures for instance that we 
can write h q as in (2.3) on {1,... One can easily check the following 
elementary property. 

Remark 2.2. V np (£) has binomial distribution with parameters n p and 
6 P (£) for each £>1. The conditional distribution ofAR n p(£) given V np (£) = 
j is binomial with parameters j and h p (£), for every £ > 1 and j = 1, . . . , n p . 
Moreover, 

(2.4) E[AR n p(i) | V n " (*)] = V n "(i)h p (i) = E[AR n "(i) \ F> x ], i > 1. 



ON THE DISCRETE CRAMER- VON MISES STATISTICS 



9 



We are now in position to derive an empirical estimator for the intensity 
function h p . The idea is fully based on the relations (2.2) and (2.4). The 
martingale component Y Up is interpreted as a noise and the predictable 
component N np contains all the information about the law of the discrete 
random variable W p needed for the estimation. Therefore, it is natural to 
introduce an estimator for h p based on the condition that Y np = 0. In fact, 
if Y np = 0, we recover the Kaplan-Meier estimator as follows 

AR Up (i) 

(2-5) h n *(i) = ^-±Ll {vnp{{)>G y, i>i. 

Based on (2.5), we then define the following estimators for H p and for the 
law 7r p of the discrete random variable W p , respectively, as follows 



(2.6) H n *{i) :=J2h np (i), 

i=\ 

i-l 

(2.7) tt"' p := h"* (i) JJ [1 - h Hp {£)} . 

i=i 

for i > 1. The Doob-Meyer decomposition (2.2) and (2.5) yield 

(2.8) - flf (i) = j % ^—\ {vnp{l)>Q} dY n P(t) + r np (i); i > 1, 

where the predictable component r np (i) := Ylt=i h p (ty[^{V™p (e)>o} ~ 1] van- 
ishes as n p — > oo due to a Borel-Cantelli argument provided 9 p (i) > 0. The 
following useful remark gives the asymptotics of the above estimators. These 
results can be easily proved by routine arguments so we omit the details. 

Remark 2.3. For a given p £ J , let i be a positive integer such that 
9 p (i) > 0. From identity (2.8) we may conclude that H Up — > H p uniformly 
(over {1, . . . in probability as n p — > oo. Moreover, h p (l) — > h p (£) and 
7r" p — > 7rJ in probability as n v — > oo, for each I € {1, . . . , i}. If p (i) > for 
every i>l, then sup^> x \ h np (£) — h p (£)\ — > in probability as n p — »■ oo. 
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3. Asymptotic Distribution for Discrete Stochastic Integrals. 

In this section, we provide the asymptotic results which will be the ba- 
sis for the statistical tests in this article. From (2.8) and Remark 2.3, we 
know that the accumulated intensity process H p admits a natural class of 
consistent estimators H np in such way that H np — H p is a discrete-time 
F np -semimartingale of the form (2.8). Therefore, a natural strategy will be 
based on a martingale central limit theorem. In the previous section, we have 
denned the filtration family {¥ np ;p € J, n p > 1} where the Doob- Meyer de- 
composition and the resulting Kaplan-Meier estimator were performed. In 
the sequel, in order to obtain the asymptotic distribution, we are forced to 
use different types of nitrations. Let 

J 

R n \i) := J2R nk (i); i > 0, 
k=l 

be the total number of events of interest at category i and let 

J 

V n \i) :=J2V nk (i); i>0, 
k=i 

be the total number at risk at category i. In order to keep track the limiting 
martingale behavior at different samples, we introduce the filtration F = 
{J~f,i > 0} generated by the whole information available at each category 
as follows 

In the sequel, to shorten notation we introduce 

VHj) := {vr(j),...,V?(j)), KUJ) := {^(j), ■ ■ ■ , R p m U)) , 

and 

K%*>(j) := (i?f' p (j), . . . , Rg*(j)), l<m<Nj;j>l,peJ. 

Let us now introduce another filtration which will be the basis for our 
asymptotic results. This filtration family is carefully chosen as follows. For 
a given category j > 1 and n* £ N J , we define the filtration Q n (j) := 
{Gm (j); < < A^j} along the samples as follows 
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Qtd) ■■= a{(^0-),A^(i-l),V^ +1 (j),A^ m (i),A^(j)), P G J], 
for any < m < Nj — 1 and 

Here we set AiK(O) = i? n *(0) and A7£g = A7^' p = 0. 

Remark 3.1. The building block for the asymptotic results of this article 
is based on a martingale structure over the filtration Q n (j) along the samples 
for a given category j > 1. In fact, one can readily see that for every p G J , 
n* G N" 7 and j > 1, Y p (j) is a Q n * (j) -martingale array difference. Moreover, 
for every p G J and n* G N J ; R np (j) has the same Doob-Meyer decomposi- 
tion w.r.t. Q n (j) (over the samples) and¥ np (over the categories). In order 
to recover the classical Tarone- Ware and Harrington- Fleming weighted pro- 
cesses, we include the term AR n (j — 1) in the definition of the filtration. 
This structure allows us to accommodate the persistent jumps at each cat- 
egory as the sample size goes to infinity. This strategy is rather different 
from Gill [13] who used a time-changed argument on the level of categories 
in order to deal with the jumps of partially discrete distributions. Gill's idea 
is to spread the jump of R np {j) at category j over a time interval which is 
inserted at this category. 

In the remainder of this paper, we always consider the random variables 
1/V nq (€) as been multiplied by indicator functions TL{v n i(e)>o] for any q G J 
and l>\. 

3.1. A Martingale Central Limit Theorem. Let us now describe a list of 
the technical assumptions which will constitute the basis for the asymptotic 
results of this section. For any pair qi ^ q in J and n* = (n\, . . . , nj) G N J , 
we are going to write 

Un q n K> •) = {Un q qi («*, *); i > 1} 

as an F-predictable process satisfying some technical assumptions. In the 
remainder of this work, n* — > oo means n p — > oo for every p G J . 

(Ml) For each (n q ,n qi ) G N 2 and n* G N J , lQ (n*,i) is Qq (i)-measurable 
for every i > 1; 
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(M2) There exists 5 > such that 



lim n r 





2+5 


V n qi 




2+5 






V n n (i) 





in probability for each i > 1; 

(M3) For any 52 G {<?><?i} an d £>1 there exists a constant a q q m (€) such 
that 



l^ g >y)i 2 

in probability as n* + 00. 

(M3') For any q 2 £ {q,Qi} we have that 



> hm sup It, — — — — < 00 



and 



E 



|tC>V)| 2 



V n n (£) 



in probability as n* — > 00. 



(M4) For any q 2 £ J ((72 Q,Q2 7^ Qi) and £ > 1 there exists a constant 
Pqi ,92 CO such that 



U^ qi {n\£)U^ q2 {n\£) 



in probability as n* — > 00. 

(M4') For any q 2 £ J (<?2 Q,Q2 ^ qi) we have that 

KUnV)tfnl(nV)! 



lim sup E- 



and 



V n i(£) 
V n "(£) 



< 00 
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in probability as n* — > oo. 

When the number of categories is finite then (M3') and (M4') are not 
necessary. Since one of the main applications of the theoretical results of 
this paper lies in lifetime data analysis under censoring, then it is crucial 
to work under the setup of infinitely many categories. Assumptions (M3') 
and (M4') encode exactly this situation. 

In fact, we are going to show that a large class of weighted processes 
satisfy the above list of technical assumptions. See Section 3.2 for more 
details. Let us now define a family of random variables which encodes any 
hypothesis test related to the homogeneity of several discrete populations 
under censoring with infinitely many categories. For a given n* G and 
q G J i we define the following random variables 

(3.1) C; 9 W :=£tC>V) 

where m = 1, . . . , Nj and we set AY^ = if < m < Nj and k G J ' . 

A first simple remark is that 1 < m < Nj} is & martingale-array 

difference w.r.t the filtration Q n (I) (see Lemma 7.1) for each q G J', n* € 
and £ > 1. Thus, we shall apply usual arguments from martingale theory 
(see Lemma 7.2) to prove that the sequence Ylm=i £mq(ty converges weakly 
to a zero mean Gaussian distribution with variance <^ q {t) given by 

<%(i) ■.= Y.<^ qi m-h q ^{t)] + Y;<^) hq m-h q {t)] 

qi¥=q qi¥=g 

(3.2) + 2 y, &*(.w q (w-h q m-, i>i, 

(<?i,<?2)eAj 

where the family of functions aq] qi , aq tQl , fi\ xm : N — > R in (3.2) are given in 
(M3) and (M4) , respectively. In (3.2), we denote A q := {(x, y) G JxJ;x ^ 
y, x / q, y / q, 1 < x < y < J} for q G J. 

In Lemma 7.2, we also show that the asymptotic variance <fiq(£) can be 
consistently estimated by 



V n i(£) V n "i(£) 
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I 2 * 



PS? (n*,£)\ 2 ~ „ \UnUn\e)\ 2 , 



yn qi (£) 



=- E 

(3.3) 

+ 2 £ ^(nVg^n^ ^ 



(9i,92)eA 9 



for E N J and £ > 1. 

In the sequel, the analysis will be based on the following multi-dimensional 
process 

/ Nj Nj \ 

f*W := E CaW, • • • , E c» ; / > I- 

\m=l m=l / 

The multi-dimensional case requires additional assumptions on a given weighted 
process. Given r ^ k in J , let U be a weighted F-predictable process which 
satisfies the following assumptions: 

(HI) For any q\ ^ k and £ > 1, there exist constants jqf(£) and T]qf(£) 



such that 



[/^(nV^tnV) 



{£) 



U^(n*,£)U^(n\£) 



T] k g l r (t) 



V n k(£) 

in probability as n* — > oo. Moreover, U'nlJJnl is non-negative a.s for every 
q^ J with q ^ k and q ^ r. 

(HI') For any 7^ fe, we assume that 



00 

E 

1=1 



JK*>V)^;(nV 

hmsupE< 1 



V n "i {£) 



\U%(n*J)U%(n*J)\\ 

+ WH£) <oc 



and 



E 



U%* (n*,£)U%r(n*,£) . 
n 9l v n fc V ,_/ fcr(£) 



+E 



UV: k (n*,£)U% r (n*,£) , 

n qi \ 1 j n qi \ __ k,r/n\ 
'9i v*/ 



yn,i (£) 
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in probability as n* — > oo. 

Of course, the above assumptions (Hl-Hl') only make sense if J > 3. 
Without any loss of generality, throughout this section we assume that J > 
3. See Remark 3.3. In the sequel, if k ^ r in J then we set A(k, r) := {qi E 
J\ qi 7^ k, qi ^= r}, and we denote 

Mk,r,£) ■.= e ^ r Wh«m-h 9 m-Y,^ r w hk w i - hk w 

qieA(k,r) q\+k 

(3.4) - J2v£m r m-h r m, *>i, 

where the functions jgf, rjgf , rfqf '■ N — > R are defined via a weighted process 
satisfying assumptions (HI). Since Jqfi-) = Iqfi') for every k ^ r in J 
provided q\ E A(k, r) then the symmetrization in the second line of (3.4) 
yields ip(k, r, •) = ip(r, k, •). Moreover, for k ^ r we denote 

ip n *(k,r,£) 



(3.5) 

for n* E N J , £ > 1. A gain symmetrization in the second line of (3.5) yields 
■0 n *(fc, r, •) = ifj n *(r,k,-) a.s for every 7^ r in J and n* E N J . Now we 
are in position to state the first result of this section. In the sequel, we set 
r(0) = and we denote T(i) := Yl\=\ QiP)i * ^ 1> where Q is the self-adjoint 
operator defined by the following quadratic form 

J 

(3.6) (Q(^)a, a) K j = ^ 4<^ (£) + 2 ^ a r a k tf;(k, r, £); a E R J , £>1. 

k=l l<r<k<J 

Convergence stated in (11.18) and (11.24) ensure that the quadratic form (3.6) 
is actually non-negative. We also define the self-adjoint random operator 
Q(n*,£) induced by the quadratic form 



: = E 1 v^W — - h *m-h«(e)] 

qieA(k,r) 1 ' 

E " v Lt } h\£)[l-h\£)] 

E 92 v ! r{ 2 — -vm - h r a)], 

qn+r 



16 



(3.7) 

J 

{Q(n*,£)a,a) u j = J24Kn*( £ ) + 2 £ a r a k ^(k,r,£); a G R J , £ > 1. 

fe=l l<r<fc<J 

We set f(n*,0) := and f(n* i) := E/=i^( n *^); n * G nJ > * ^ L 

Since the variables of interest {W p ;p G assume values in an un- 
bounded set in a typical discrete lifetime data, it is important to introduce 
the following objects. Let {d l n „;n* G N J } and {d^«;n* G N J } be two se- 
quences of F-stopping times which satisfy the following hypotheses: 

(51) d l n » < d™* < oo a.s for every n* G N" 7 and there exists a pair (d l ,d u ) G 
N 2 such that 1 < d l < d u < oo, and 

d n * — > d l and d%* — >■ <i n in probability as n* — >• oo. 

(52) If d" < oo then 0P(d u ) > for every p G J. If d" = oo then 0P(») > 
for every i > 1 and p £ J. 

In the sequel, i>ec(^4) denotes the usual vectorization of an m x n matrix A, 
i.e., uec(^4) := [an, . . . ,a m i,ai 2 , • • • ,a m 2, • • • ,a>in, ■ ■ ■ ,a mn ] T . Here repre- 
sents the (i, j)-th element of a given matrix j4 and the superscript T denotes 
the transpose. 

Theorem 3.1. Assume that a weighted process U satisfies assumptions 
(Ml, M2, M3 1 , M4') and (Hi'). Then for every sequence of F-stopping 
times d l n * and d™* satisfying (S1-S2), we have 

(3.8) £ n *W -> N (o,T(d u ) - T(d l - 1)) weakly as n* -> oo. 

l = S x 

Moreover, 

(3.9) vec(t(n\d^) -f(n*,d! n * - 1)) -> wec^d") - r(d' - 1)) 
in probability as n* — > oo. 



ON THE DISCRETE CRAMER- VON MISES STATISTICS 



17 



Remark 3.2. Assume d u < oo and 9 p (d u ) > for every p G J and 
£mg(^) is square-integrable for every n* G N J ,q G ^7 and £ > 1. T/ien i/ie 
result stated in Theorem 3.1 holds under assumptions (M1-M2-M3-M4) 
and (HI). 

Remark 3.3. If we have just two populations (J = {1,2},) then the 
underlying covariance structure in Theorem 3.1 simplifies substantially since 
in this case 

J 

(Q(l)a,a) uJ = J24$lW, 

k=l 
J 

(Q(n\£)a,a) RJ = J>^(*), 

k=l 

for a G M. J , n* G N J and 1 < I < oo. 

In the sequel, we explore the whole trajectory of the B^-valued process 
£ n weighted by the sequence 4> Tl * := {(/>i >n *(i), ■ ■ ■ , 4>.J,n*{i) ;i > 1} in a 
suitable Hilbert space which encodes the quadratic powers of £ n . Doing so, 
our main motivation and application for the next result is the introduction 
of the Cramer-von Mises test statistics under arbitrary right censoring for 
infinitely many categories. Let us define the following M^-valued weighted 
random field 

{r Nj 
^WEECW; q = i,...,j 
e=i m=l 

for r > 1, x = {x q (i); q=l...,J, i > 1} and n* G N J . With this Revalued 
random field, for a given (n, m) G N 2 with 1 < n < m, x G M.°° and n* G N" 7 
we define 

(3.11) GET(n*, x, n, m) := (GLR(n*, x,n),..., GLR(n*, x, m)) . 

Of course, under mild assumptions on the weights (x, U™1 ) and the sample 
we can safely embed the process GET(n*, x, ■, ■) into the Hilbert space £ 2 (N) 
constituted of square-summable real sequences over N. In the sequel, we 
make use of the following notation: || • ||^2 stands for the usual norm on the 
Hilbert space £ 2 (N), M(£) := diag(^(£), . . .,<f>j(£)). 
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In the sequel, for a given 1 < s < i < oo we consider the self-adjoint 
operator Y(s,i) : M. Jk ( s,i ) — > M" 7 ^'*) defined by the following quadratic form 

k(s,i) 

(Y(s,i)a,a} RJk ( s ,i) := ^ {M(j + s - l)T(j + s - l)M(j + s - l)aj,aj) R j 

i=i 

+ (M(£ + s- l)T(£ + s- l)M(j + 8- l)a t ,aj) RJ 

^<£<j<k(s,i) 

+ Y, (M(£ + s-l)r(j + s-l)M{j + s-l)a e , aj ) R j, 

l<j<£<k(s,i) 

for a £ R^ fc ( s ' 1 ); k(s, i) := i — s+1. From the definition of the above quadratic 
form, we notice that for each s > 1 the restriction of Y(s,j) onto R^ fe ( s 'v 
is equal to Y(s,i) for every j > i. Therefore, for a given (s,m) £ N 2 with 
1 < s < m < oo we shall construct a linear map 

(3.12) y(s, m) : R°° -> R°° 

defined as follows. If m < oo, then we set y(s, m)a := Y(s, m)(ai, . . . , ajk(s,m)) 
so that 3^(s, m) = y(s, m). If m = oo, then for a given a E M°° the Jk(s, i)-ih 
coordinates of the action y(s,m)a is defined by 

(3.13) {y(s,i)(ai,...,a i7fc(3)i ))}; s < i < oo. 



Theorem 3.2. Assume that assumptions (Ml, M2, M3 , M4') and 
(HI') hold and let 1 < d l < d u < oo where (S1-S2) holds. Then the weak 
limit 

(3.14) Urn GET{n*,0,d l ,d u ) 

n*— ¥oo 

is a zero-mean Gaussian measure on £ 2 with covariance operator y(d l ,d u ) 
on I 2 defined by (3.12) and (3.13). In particular, 

oo J 

(3.15) \\GET(n*J n *,d! n *,d%*)\\ 2 e2 -> Y^52 X *A weakly as n* -> oo, 

s=l g=l 
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where {X sq ; s > 1, q = 1, . . . , J} are the eigenvalues ofy{d\d u ) and {xL> s — 
l,q = 1, . . . , J} is an iid subset of chi-squared random variables with one 
degree of freedom. 

The remainder of this paper is devoted to give applications of Theo- 
rems 3.1 and 3.2 to the analysis of purely discrete populations typically 
founded in a lifetime data setting. At first, we exhibit a large class of 
weighted process which satisfies the assumptions in Theorems 3.1 and 3.2. 

3.2. Weighted Processes. A large number of weighted processes satisfy 
(Ml, M2, M3', M4') and (HI'). For instance, they can be chosen accord- 
ing to the following generic class 

(3. 16) ^ („.,«=_ (i) ^(ir^cs),^!, 

for any pair q ^ q\ in J and n* = (m, . . . , nj) G N J where we set n := 
Yli=i n i- We assume that the weighted process u(n*, •) is bounded, it satisfies 
the measurability assumption (Ml) and it converges in probability to a 
bounded real-valued function uj. A similar class has previously appeared in 
Andersen et al [5] for the continuous case. In the sequel, we denote by 1C the 
class of all weighted process which can be represented by (3.16). 

Proposition 3.1. Let us assume the existence of the limit b p = lim n *_ s . 00 n 
and X p is integrable for every p G J . Then, every weighted process in the 
class KL satisfies assumptions (Ml, M2, M3 1 , M4' ) and (Hi'). 

A significant subclass of tC is given by the classical Tarrone-Ware [29] 
and Harrington-Fleming [17] weighted processes. The weighted functionals 
u(n*, •) are given, respectively, by 

where <p is a bounded continuous function and (3 and 5 are positive constants. 

4. The Log- Rank Statistics. In this section, inspired by the weighted 
log-rank statistics proposed by Gill [13], Fleming and Harrington [17] and 
Andersen et al [5], we propose a test in order to verify the homogeneity of 
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discrete populations in the presence of arbitrary right censoring. Our goal 
is to derive a class of statistical tests for the null hypothesis 

H : h 1 ^) = h 2 (£) = ■■■ = h J {£);£ E N. 

Throughout this section, all weighted processes belong to the class /C. In the 
sequel, we denote 



(4.1) d u = sup {£ : min 6 q {£) >0 , 



(4.2) dl* = sup U : imnV n «(£) > j , n* G N . 

One can easily check that d^* — > d u in probability as n* — > oo, where 
1 < d u < oo and d™* < oo a.s for every n* £ N J . If q € JT, then we introduce 
the following general linear J-sample statistics 



J 



, 1 \ 1/2 

i=i ^ ^ 



/i n *(£) - h n n {£) 

AR n "(£) AR n * 



V n t{£) V n *(£) 



for n* G N^, j > 1. We notice that LR q (n*,-) is the g-th component of 
£ n under the particular null hypothesis i^o- Following Theorem 3.1 and 
Proposition 3.1, under Hq the random vector 

LR(n\ := (LR^n*, (%*),..., LRj{n\ d u n ,)) T 

converges weakly to N(0,T(d u )) as n* — > oo, where T(d u ) admits a con- 
sistent estimator r(n*,d^t) given by the matrix induced by the quadratic 
form (3.7). Similar to the continuous case, the sum of the components of the 
random vector LR is null and hence we consider the vector LR without the 
last component as follows 



LRo(n*,d&,) := (LR^n*, c&)> . . . , LRj^(n\ d£*) T . 

In what follows, we denote Tq(i) := X^=i Qo(^) where Qq is the operator Q 
defined in (3.6) without the last row and column. We are now in position to 
define the weighted log-rank statistics associated to LRq as follows 
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(4.3) X 2 {n\dl*) := LR (n\ d u n *) T t „(n*, d^y 1 LR (n* , n* G N J , 

where To(n*,j) := Y2i=i Qo( n * ; ^)i j > 1- One can easily check that the 
statistics (4.3) is asymptotically chi-square distributed with J — 1 degrees 
of freedom, where Tq(ti*, <i^*) _1 is the ordinary inverse. 

Remark 4.1. Under Hq, we denote the intensity function by h p (£) = 
h(£) and OP {I) = 9(£) for each £ > 1 and p G J . It follows from (9.5) that 
the covariance component of V is negative. In fact, for each k / r in J 

ip(k, r,£) = -J oj 2 (£)b r b k 6(£)h(£)[l - h(£)], £ = d l ,...,d u . 

In this case, we can apply the same arguments of Andersen et al [5] to 
conclude that the rank(T{£)) = J — 1 for d l < £ < d u . By consistency of 
T(£), the probability that T(£) has rank J — 1 increases to unity as n* — > oo. 

Remark 4.2. The class of statistics X 2 (n*,-) indexed by weighted pro- 
cesses in the class 1C is rather general in the sense that it covers the classical 
cases of Tarone-Ware and Harrington- Fleming but it is not restricted to 
these cases. In fact, we have proved that one can choose any weighted pro- 
cess U which satisfies the assumptions in Theorem 3.1. Under these assump- 
tions, one can always define a correspondent weighted log-rank statistics of 
the form (4.3). 

Type 2 Censoring. In many practical applications, the censoring scheme is 
linked to a failure time process. For instance, let us suppose that the stopping 
time for a product life testing is not fixed a priori, i.e., it is not fixed before 
the beginning of the study but it is chosen later, with the choice influenced 
by the results of the study up to that time. The so-called Type 2 censoring 
refers to the case which based on the observed data on that moment, one 
may want to stop the experiment at some stopping time. Let us suppose the 
case where the experiment finishes at the occurrence time of the /3-quantile 
of the observed data (0 < (3 < 1). In order to develop a suitable model for 
this important type of censoring, we assume X p = W p and stop the weighted 
log-rank process LRq(ii* ', •) at the time of the occurrence of the /3-quantile. 
Let us consider the following stopping times 

4* := inf{£ > l;R n *{£)/n > /3}, 
£° := mf{£ : hF 1 ^) + ... + bjF J {£) > /?}, 
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where F q is the distribution function of W q , q € J . One can easily check 
that the following convergence holds 



in probability as n* — > oo. As a consequence of Theorem 3.1 and Proposi- 
tion 3.1, we arrive at the following result. 

Corollary 4.1. Under Hq the log-rank statistics LRo(n*,£ n *) con- 
verges weakly to N(0,T (£ )) as n* — )• oo, where To^ ) admits a consistent 
estimator To(n* ,£ n *). 

5. The Cramer- von Mises Statistics. The goal of this section is 
to propose a class of statistical tests for the null hypothesis Hq which is 
consistent to any alternative hypothesis. As a consequence of Theorem 3.2, 
we are able to introduce a Cramer-von Mises statistics for purely discrete 
populations under censoring as described in Section 2. In the remainder of 
this section, we encode the first observed categories by the following sequence 
of F-stopping times 

(5.1) d l n * := inf{^ : AR n * (I) > 0} 

and we assume that all weighted processes belong to the class fC. One can 
easily check that 

d l n * -+ d l 

in probability as n* — > oo where 

(5.2) d l :=mf{£:bih 1 (e) + --- + b J h J (£) > 0} 

for bp = lim n *^. 0O n p /n; p G J . Let us now introduce a version of the Cramer- 
von Mises statistics in order to test homogeneity of discrete populations in 
the presence of arbitrary right censoring with infinitely many categories. 
From Theorem 3.2, we only need to consider the particular case when the q- 
th component of £ n is the log-rank statistics with dimension J — 1. That is, 
GET(n* , (fr n * ,i, j) in (3.11) is composed by the following weighted random 
field 



(5.3) GLR(n\ n *,r) = {4> q>n *(r)LR g (n* , r); q = 1, . . . , J - 1} 
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for r > 1, n* € N J . We set M (£) := diag(0i (£),..., (f>j-i(£)) and the 
covariance operator in Theorem 3.2 (see (3.12)) is defined in the same way 
but Mq and Tq instead of M and T in the quadratic form. We denote this 
linear operator by yo{d l ,d u ) so that the (J — l)k(d!,i)-th coordinates of the 
action y^d 1 ,d u )a is {Yq{$ ,i)(a\, . . . ,Q<ij-\)kU l ,i))} f° r B ^ 2 an d k(d l ,i) = 
i - d l + l;i > d l . 

The natural candidate for the estimator of the operator yo(d l , d u ) : £ 2 — > 
£ 2 can be constructed in a natural way as follows. To shorten notation, we 
introduce the random set L(dJ,*,d£*) = {d l n * < £ < d£* : AR n * (£) > 0} 
of observable categories and we denote L(n*) its cardinality. For a given 
n* 6 and a € £ 2 , we define the action 

Md l n *,d u n *)a 

as the real sequence where the (J — l)L(n*)-th coordinates are given by 

Yo(d l n *,dn*)(ai . . . ,a(j_i)L( n *)) 

and Y (d n *, d^) is the self-adjoint random operator defined by the following 
quadratic form over R( J-1 ) L ( n *) 



(Y (d l n *,d%*)a,a) = £ {Mo(j)foO')Mo(i)%,ai)KJ-i 

{£<rAjeL(d l niI ,dZ*)} 

+ J2 (Mo^foOOMoO')^,^)^-!, 

{j<M,jeL(S nir ,d^)} 

where M Q := diag(0i, n *(-), • • • , <j>j-i, n *(-)) and a E R(- 7-1 ) I, ( n *). Therefore, 
2Po(^n*)^n*) : ^ 2 — ^ ^ 2 is a well-defined sequence of self-adjoint finite-rank 
random operators. 

In view of the log-rank composition in (5.3), we are now in position to 
introduce the Cramer-von Mises statistics associated to the general discrete 
censoring model described in Section 2 as follows 

WM(n*,4,4) := \\GET(n*J n *,d l n +,d^)\\%;n* G N J ~\ 

As a consequence of Theorem 3.2 and Proposition 3.1, we arrive at the 
following result. 
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Theorem 5.1. Assume that U belongs to the class JC, the growth condi- 
tion in Proposition 3.1 holds and let (d l , d u , d l n * , d£*) be the categories and 
the stopping times defined by (5.2), (4-1), (5-1) and (4-2), respectively. Then, 
under Hq 

oo J— 1 

(5.4) CVM(n\ d l n * , dfc ) -> E X "A weakl y « «* °°> 

s=l q=l 

where {A sg ; s > 1, q = 1, . . . , J — 1} are i/ie eigenvalues of the covariance 
operator y${d l ,d u ). In particular, if X q is square-integrable for every q £ J 
then 

L(n*)j-1 oo J-l 

(5.5) A(n*) := E E V&^Cn*) ~* S I] X *<iX% 

s=l q=l s=l g=l 

weakly as n* — > oo, where {X sq : 1 < s < L(n*), q = 1, . . . , J — 1} are t/ie 
random eigenvalues of the covariance operator estimator yo(d l n * , d™*) and 

A(n*) := (yo(d l n *,d%+) is non-negative^ , 
so that F(A(n*)) -)■ 1 as n* ^ oo. 

We notice from (5.5) that the P-value for the hypothesis test Hq is given 
by P[A(n*) > CV M (n* , d l n * , d^) \ H ]. The approximate law A(n*) is a 
weighted sum of independent chi-squared random variables and hence sev- 
eral algorithms to evaluate the P-value are available. See e.g Duchesne and 
Micheaux [10] for a recent discussion. 

6. Simulation. In this section, we perform a simple simulation study 
to evaluate the behavior of the classes of statistics proposed in this paper. 
We analyze the effect of the sample size, the proportion of censored data and 
the number of populations. Here, we assume the variable of interest follows 
a Poisson distribution with parameter X p for any p € J and the censoring 
variable also follow a Poisson distribution with parameter A c . In this section, 
all statistics are considered in terms of of the weight u(n* ,£) = 1 in the class 
K.. The goal is to test the hypothesis Hq : Ai = A2 = • • • = Aj which is 
equivalent to the hypothesis Hq : h 1 ^) = h 2 (£) = ■■■ = h J (i); l>\. The 
simulation is performed by means of the software R ([25]) and the P- value 
is evaluated via Davies algorithm ([9]). 
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In order to evaluate the convergence of the proposed statistics, we have 
sampled from several populations with Poisson distribution with Ai = 100 
and taking into account different sample sizes and censoring variables. It was 
generated 10000 samples for each sample size (SS), A c (without censoring, 90 
and 100) and populations (2,4 and 8). Tables 1, 2 and 3 show the empirical 
significance level related to the nominal significance level a = 0.05. 





Two Populations 


Four Populations 


Eight Populations 


SS 


CVM 


LR 


CVM 


LR 


CVM 


LR 


50 


0.0493 


0.0564 


0.0478 


0.0569 


0.0706 


0.072 


100 


0.0492 


0.0531 


0.0495 


0.0516 


0.0652 


0.0602 


150 


0.054 


0.0549 


0.0526 


0.0501 


0.0599 


0.0593 


200 


0.0478 


0.0493 


0.0536 


0.0527 


0.0595 


0.0544 


250 


0.051 


0.0524 


0.0526 


0.0528 


0.0549 


0.0563 


300 


0.0493 


0.0528 


0.0534 


0.0523 


0.0545 


0.0541 



Table 1 
Without Censoring. 





Two Populations 


Four Populations 


Eight Populations 


SS 


CVM 


LR 


CVM 


LR 


CVM 


LR 


50 


0.0506 


0.0513 


0.0457 


0.0588 


0.0535 


0.0682 


100 


0.0534 


0.0507 


0.0454 


0.0538 


0.0512 


0.0595 


150 


0.0526 


0.0504 


0.0497 


0.0558 


0.0535 


0.0581 


200 


0.0504 


0.0519 


0.0482 


0.0525 


0.0529 


0.0558 


250 


0.0475 


0.0487 


0.0504 


0.0525 


0.0456 


0.051 


300 


0.0517 


0.0547 


0.0482 


0.0499 


0.0505 


0.0539 



Table 2 

Censoring Variable: Poisson with A c = 100. 





Two Populations 


Four Populations 


Eight Populations 


SS 


CVM 


LR 


CVM 


LR 


CVM 


LR 


50 


0.0491 


0.0477 


0.0285 


0.0711 


0.0326 


0.1051 


100 


0.0483 


0.0484 


0.038 


0.0578 


0.0387 


0.072 


150 


0.048 


0.0485 


0.0457 


0.0531 


0.0446 


0.0575 


200 


0.0474 


0.0482 


0.043 


0.0552 


0.0451 


0.0635 


250 


0.0516 


0.0532 


0.0441 


0.0506 


0.0425 


0.0569 


300 


0.0481 


0.048 


0.0429 


0.0532 


0.0451 


0.0591 



Table 3 



Censoring Variable: Poisson with X c = 90. 

In all cases, the simulation shows that both test statistics have approx- 
imately the same behavior even for small and moderate sample sizes. Un- 
surprisingly, the number of populations involved in the analysis affects the 
convergence of the log-rank and Cramer-von Mises statistics. Moreover, as 
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the proportion of censored data increases, the number of populations be- 
comes more relevant for the convergence. 

As pointed out in the Introduction, the logrank test has little power 
for crossing hazard functions. In order to evaluate the performance of the 
Cramer-von Mises test developed in this paper, we analyze one classical 
discretely recorded data set with crossing intensities described in Klein 
and Moeschberger ([21], pp. 211). A clinical trial of chemotherapy against 
chemotherapy combined with radiotherapy in the treatment of locally unre- 
sectable gastric cancer was conducted by the Gastrointestinal Tumor Study 
Group. In this trial, forty-five patients were randomly divided into two 
groups and medically accompanied for eight years. We wish to test the null 
hypothesis Hq that the intensity functions of the two groups are the same 
by using the Cramer-von Mises statistics developed in this paper. By setting 
(u(n*,-) = 1) in (3.16), we obtain CVM = 0.0926 with P-value of 0.029, 
so the null hypothesis of no difference among intensity functions between 
the groups is rejected at the 5% level. The same hypothesis test based on 
the continuous versions of Renyi (P-value = 0.053) and Cramer-von Mises 
(P-value = 0.06) statistics do not reject Hq at the same 5% level. This 
result stresses the importance of modeling discrete data with methodolo- 
gies based on purely discrete distributions. In this particular case, classical 
methods based on continuous distributions fail to reject Hq while the dis- 
crete Cramer-von Mises test developed in this article successfully reject it 
at a given level of significance. 

7. Proof of Theorem 3.1. In this section, we provide the proof of the 
asymptotic result stated in Theorem 3.1. Proofs of Lemmas 7.1, 7.2, 7.3 and 
Proposition 7.1 are given in the Appendix 11. The following simple remark 
gives the expected probabilistic structure of £J^ q given by (3.1). 

Lemma 7.1. Assume that U satisfies assumption (Ml ) and {ff^ q (£); 1 < 
m < Nj,£ > l,n* G N J ,q G J} is a subset of L 2 (F). Then for each 
£ > 1, n* G N J and q G J , {(Cm : q(£)] 1 < fn < Nj} is a square-integrable 
martingale- difference w.r.t the filtration Q n (£) = (£); 1 < m < Nj}. 

Throughout this section, we assume that E|£™ j(? (£)| 2 < oo for every n* G 
N J , 1 < m < Nj, i > 1, and q G J. 

Lemma 7.2. Assume that a weighted process satisfies assumptions (Ml, 
M2, M3, M4)- Let i be a positive integer such that 6 p (i) > for every 
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p G J ■ Then for each I G {1, ••-,«} and q G J , 

(7.1) £ C*,W ->• ^(0, W) ^e«% a* -> oo. 

m=l 

T/ie asymptotic variance (/>g(£) is the limit in probability of 0^ n *(£) as n* — >• 

00 /or eac/i £ € {1, . . . , i}. 

Lemma 7.3. Assume that a weighted process U satisfies assumptions 
(Ml, M2, M3, M4). Let i be a positive integer such that 9 p (i) > for 
every p G J . Then for each q G J , the random variables {X^m=i 1 — 

1 < i} are asymptotically independent and 

i Nj i 
f=l m=l £=1 

Proposition 7.1. Assume that a weighted process U satisfies assump- 
tions (M1-M2-M3-M4) and (HI). Let i be a positive integer such that 
6 p (i) > for every p G S- Then for each £ G {1, . . . , i} 

weakly as n* oo, where 

(7.2) vec(Q(n*,£)\ ^vec(^Q(£)\ 

in probability as n* — > oo /or eac/i £ G {1, . . . , i}. In particular, 

i 

(7.3) E f "* W Ar (°' r (*)) wea ^ as ™* ~* °°- 
Proof of Theorem 3.1. 

Throughout this proof C is a constant which may defer from line to line. At 
first, we assume that d l = 1, d w = oo and 6> p (i) > for every p G J and posi- 
tive integer i > 1. By observing the identity (11.4) together with (M3'-M4') 
we readily see that £^ g (£) is square-integrable for every q G JT, £ > 1, n* G 
and m = 1 . . . , iVj so we are able to apply Proposition 7.1 accordingly. 
Let {W(i) = (Wi(i), Wj(i));i > 1} be the Revalued F-martingale with 
independent increments given in the proof of Proposition 7.1. We claim that 



2b 



(a) r(oo) = YleLi Q(£) i s a well-defined nonnegative self-adjoint operator 
on R J . 

(b) The weak convergence holds X^fci £ n W — ^ -^(0) r(oo)) as n* — > oo. 

Let us check (a). For a given a G we know that (W(i),a)^j has Gaus- 
sian law N(0, X]^=i(Q(^) a ) a )) f° r eacn « > 1- The definition of the co- 
variance operator Q in (3.6) and assumptions (M3'-M4') and (HI') yield 
su Pi>i X^=i IIQCOII < 00 • in particular, the following estimate holds 



a, a) 



< 



£=1 



EE 

l=\ k=i 



4 



E 



a 



k,qi 



(0 



'/i 



(£)+2 



(<7: 



E 



oo 



£=1 l<r<fc<J \qieA(fc,r) gi^fc <j 2 ^r- / 

for every i > 1 and a G M J . Hence, T(oo) := £Xi <W) 

converges absolutely 

on the space of matrices and it is the self-adjoint non-negative operator 
associated to the quadratic form YllbL\{Q(p) a i a )u J '■> a ^ ■ Now let us 
check (b). From Proposition 7.1 we know that W(i) ~ N(0,T(i)) and hence 
the previous argument allows us to define the iV(0, r(oo)) -Gaussian variable 
W'(oo) := linii-j.oo W(i) (weak sense). 

To shorten notation, we set W n * (i) := ELl > l,n* G N J . By us- 

ing the same argument as in (11.12) one can easily check that {X^=i ^ (£); i > 
1} is an F-martingale-difference so that {(VP™ (i),a) K j;i > 1} is an F- 
martingale for every a G M^. For each i > 1, Proposition 7.1 yields W n (i) —> 
W(i) ~ N(0,T(i)) weakly as n* — > oo. We now claim that the weak limit 
W n (oo) := linij^oo W n (i) exists for each n* G N J . In fact, for a given 
a G and n* G N , a straightforward but lengthy calculation shows that 
the quadratic variation of the martingale (W n (•), a) R j at a given point i > 1 
can be estimated as follows 



<=1 £=1 fc=l W^<? 



v K>v)l 2 v |cC>v)e%>v)I , 

31^9 (9l,92)eA, 
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E 2 E W E 91 vim + 

1=1 \<r<k<J \qi^k y 1 

\U^ n {n\t)UZi(n\l)\ \U%(n*J)U%rJ n *J) 

2.^ Vnu(P\ + 2_> 



V n "(£) ^ V n -(£) 

Therefore, we may use assumptions (M3'-M4 / -H1 / ) to ensure that for each 
a£B J 



(7.4) sup EV|A(VF n >),a)| 2 < oo 

i>l,n*eN J e=l 

and hence the Doob maximal inequality jointly with the Martingale conver- 
gence theorem yield linii_ i , 00 (W/ m (i),a) K ./ in probability for every a E M J 
and n* E N" 7 . This fact together with Cramer- Wold allow us to conclude 
that W n *(i) -> YT=\ weakly on R J as i -> oo for each n* E N J . At 

this point, we know that 



(7.5) lim lim W n \i) = W{oo). 

i— >oo n*— too 

Fix a E M J and denote VF n *(oo) = Y£iC*(fyn* G N J . Since {(VF n *(i), a) R j; 1 < 
i < oo} is a closed martingale then we can estimate the L 2 -norm from the 
quadratic variation as follows. For a given e > 0, the following estimate 
holds for each n* E N J and i > 1 as follows 

F{\(W n \oc)-W n *(i),a) RJ \>e) < 4jE|<(vK(-), a) RJ >(oo) -«#"*(■), o) RJ )(i) 

1 oo 

= A«PK(-),a) RJ )(*) 

£=i+l 
1 oo 

(7.6) = -^E ^ l(^ n *W- W n *(£-1),< 

£ £=i+l 

Assumptions (M3'-M4'-H1 / ) (in particular (7.4)) then imply that 
(7.7) lim UmsupPf |(W n *(oo) - W n \i), a) RJ I > e) = 0. 



1 2 

i A) IB J 
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From [[7], Theorem 3.2 page 28] and Cramer- Wold, the convergence (7.7) 
allows us to exchange the limits in (7.5) and therefore W n (oo) — > W{co) 
weakly as n* — > oo, thus we conclude (b). In particular, we have shown 

(7.8) W n \d u ) - W n \d l - 1) -> N(0,T(d u ) - T(d l - 1)) 

weakly as n* — > oo. We now proceed by using the above arguments on the 
set {d l , . . . , d u } and this time we have to play with the stopping times d l n * 
and d%*. We fix a € M. J and we write 

{W n \dl*) - W n \d l n *),a) RJ = (W n \d u ) - W n \d l ),a) RJ + 

(W n *(d^) - W n *(d u ),a) KJ + (W n \d l ) - W n \dl),a) RJ . 

By considering c"(n*) = d u A and (^(n*) = d u V d%*, we may follow the 
same steps given in (7.4) and (7.6) to show that for every e > 

F[\(W n \d u )-W n \d u n ,),a) RJ \>e] < £ \A(W n * (I), a) RJ \ 2 -+ 

f=c«(n*)+l 

as n* — y oo. The same argument also applies to d A d l n * and d V d^* and 
therefore Cramer- Wold and (7.8) allow us to conclude that (3.8) holds. 

The limit (3.9) when d u < oo is a direct consequence of Proposition 7.1 
and in particular (7.2). So we only need to prove the case d u = oo, i.e., 

vec\T(n*, d%* )J — > uec^r(oo)^ in probability as n* — > oo. In fact, one has 
to check the following convergence in probability 

a n* OO 

(7.9) ->Z>?(*);9 6 



(7.10) J^^ n *(A;,r,£) -»■ ip(k, r,£); 1 < r < k < J. 

£=1 €=1 

as n* — >■ oo. In the sequel, to shorten notation we write rf(l) = h p (£)[l — 
h p (l)\ for p G J, I > 1 and we proceed componentwise. For a given qi ^ q 
in i7, we shall write 
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K \Un q q An* 



E 

l=i 



oo 



i=i y ' 



E <*£,,ww 



(7.11) := T x {n*)+T*{n*)+T^n*). 

Assumption (M3') and the fact that sup^ >1 < 1 for every p € J yield 

TT=i afiqiiW 1 ^) < oo so that T 3 (n*) ^0 in probability as n* — > oo. Also 
from (M3') we have {^Xi I^K.^P^W)" 1 ;"* G is bounded 
in probability and therefore Remark 2.3 implies 

00 It/™' (n* £)\ 2 
mn*)\ < sup |rf« (*) - ^ (£)| £ ^ ° 

in probability as n* — > oo. The assertion that T2(n*) — > is a direct conse- 
quence of (M3'). By using exactly the same above argument for the other 

terms in the difference X^=i Pq,n*(£) ~~ an d X^=i r > ^) ~~ 

Sfci VK^> r i ^) together with the correspondent assumptions (M4'-H1'), we 
arrive at (7.9) and (7.10). This allows us to conclude the proof. 

8. Proof of Theorem 3.2. We start with the simplest case d u < oo 
and to shorten notation we denote {i) the g-th coordinate of the vector 
W n (i) introduced in the proof of Theorem 3.1. Throughout this proof, any 
element for p < oo is identified as an element of £ 2 in the obvious way, 

C is a constant which may defer from line to line and we set k(p, q) = q—p+1 
for any 1 < p < q < oo. We can write 

GET(n*,<f>,p,q) = {M p W n \p),...,M q W n \q)) 

where Mj := diag (4>i(j),... ,</>j(j)); j > 1. A direct application of The- 
orem 3.1 yields MiW n * (i) iV(0, MiT(i)Mi) weakly as n* ^ oo for each 
i > d l and more importantly, for every a £ IR- 7 ^ 1 ' 1 ) we have 



(GET(n\ <p, d l ,i),a) uJHdKi) -> n(o, (y(d l ,i)a, a) Rjk(dl ^ 
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weakly as n* — > oo. This shows that for each i > d 1 



(8.1) GET(n\(/),d l ,i) N(0,y(d l ,i)) 
and 

k(d l ,i) j 

(8.2) \\GET(n*,<j>,d l ,i)\\ 2 e ^ £ 

s=l q=l 

weakly as n* — > oo, where {A S(J ; 1 < s < k(d l ,i),q = 1,...,J} are the 
eigenvalues of y(d l ,i) and {x%', 1 < s < k(d l ,i),q = 1, . . . , J} is an i.i.d 
subset of chi-squared random variables with one degree of freedom. Next, 
we have to play with the estimators c/) n *, d l n * and d™*. Triangle inequality 
yields 

\\GET(n\(b,d l ,d u ) -GET(n*J n *,d l n *,dt*)\\e < 



\\GET{n\ 4> n *,d\ d u ) - GET{n\ $,d\d u )\\p + 



\\GET(n\ 4> n * , d\d u ) - GET(n\ $ n *,d l n * , = : 
T 1 (n*)+T 2 (n*). 

For a given q G J and n* G N" 7 , {W™ ;i > 1} is an F-martingale and 
therefore Doob's maximal inequality and assumptions (M3' ,M4') yield 

E sup \Wf(i)\ 2 < CE(Wf) (oo) < 



(8.3) 2e£ Y, WZ{n *f5l> {n ' M <C^n^K>. 

Estimate (8.3) implies in particular that {sup i>:l |W™*(i)| 2 ; n* G N J } is 
bounded in probability for each q € J. Therefore, Lemma 7.2 yields 
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k(d!,d u ) J J 
(8.4) Ti(n*) 2 < El^( s )-^( s )| 2 E su Pl W r«| 2 ^ 

8=1 q=l q=l *- 1 

in probability as n* — > oo. By the very definition, 
(8.5) 

T 2 \ n ) '■= \ Ulrll ™*j« n t 

So T2(n*) —7- in probability as n* — >• oo. Summing up the above estimates 
we conclude (3.14) and (3.15) when d u < oo. Let us now treat the case 
d u = oo. At first, we notice that y(d l ,d u ) is a nuclear operator. For a 
given i > d l , let {X sq ; 1 < s < k(d l ,i),q = 1, . . . , J} be the eigenvalues of 
y{d\ i) and let {x%; l<s< k(d l ,i),q = 1, . . . , J} be an i.i.d subset of chi- 
squared random variables with one degree of freedom. Convergence (8.2), 
properties (M3 / -M4 / ) and estimate (8.3) yield 

(8.6) 

Jk(d l ,i) Jk{d l ,i) oo J 

V A S = E V X a ^<^ME\\GET(n*,<f>J,i)\\%<cf]y i <^{£) ) 

8=1 8=1 £>d l 9=1 

for every i > d l and hence (8.6) yields Tr (y{d l ,d u )) = Y^T=i X s < oo. Let 
us now check tightness of the family {GET(n*, d l ,d u );n* G N J }. From (8.3) 
and (M3'-M4'), we have 

J oo J 



(8.7) E\\GET(n*, d l ,d u )\\j 2 < ^Esup \Wf {s)\ 2 x ]T^|<^( S )| 2 < C 

q=l S - X s=l q=l 

for every n* G N J . In particular, (8.3) yields 

oo J J oo J 

(8^ E£ j>,(s)W?>)| 2 < ^Esup|Wf( S )| 2 x £j> fl (r)| s 

n*£NJ s=JVg= l g=i «>1 r=JV«/=l 

J oo 

< C^^|^(r)| 2 ^0, 
5=1 r=J V 
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as N -)• oo. Hence, (M3'-M4'), (8.7) and (8.8) allow us to conclude the 
relatively weak compactness which together with the weak convergence of 
the finite-dimensional projections (8.1) imply 

(8.9) GET(n\ct),d\d u ) -> N(0,y(d l ,d u )) 

weakly as n* — > oo. It remains to play with the estimators, but this is 
a straightforward consequence of the previous arguments. In fact, triangle 
inequality yields 

\\\GET(n*,<(>,d l , ^11% - \\GET(n*,4> n *,d l n *,c%*)\\%\ < 
\\\GET(n\^,d l ,d u )\\% - \\GET(n\^d l ,d u )\\%\ + 

\\\GET(n\4> n ,,d l ,d u )\\j 2 - \\GET(n*J n *,d l n *,d%*)\\l\ =: 
r 3 (n*)+T 4 (n*). 

The same arguments given in (7.9) and (7.11) allows us to get Yl^Li Ylq=i l^g, 
4>1{s)\ — > in probability as n* — > oo and since sup^ \ W™ (i)\ 2 is bounded 
in probability, we can safely conclude 

oo J J 

(8.10) T 3 (n*) < E 8U Pl< 

8=1 q=l q=l *- 1 

in probability as n* — > oo. By the same reason, 

oo J 

r 4 K)< £ ^|4, n ,( s )wf( s )| 2 ^o 

s=k(d l ni ,,d% ir )+l 9=1 

in probability as n* — > oo. Convergence (8.9) jointly with Ts(n*) +T±{n*) — > 
in probability as n* — > oo allow us to conclude the proof. 

9. Proof of Proposition 3.1. At first, we check (M1-M4) and (HI). 

Let us fix a positive integer £ > 1. Assumption (Ml) is obvious. If 5 > 
then we shall use the sample growth condition and the definition of u to get 
the following estimate 

2+5 



Un q qi (n\i) 
q V n «{i) 



— > as n* — > oo, 
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for any q ^ q\ in J . This shows that assumption (M2) is satisfied. For a 
given q / q± in J and q 2 = q we have 



(9.1) 



y n ?i (£) 

u(n*,£) }„> 



'n q V n «(£) 



n n 



For q 2 = qi, we have a similar expression. Now, if q 2 £ J (q 2 ^ q, 1i ^ Ql) 
then we shall write 



(9.2) 



UZ(n*J)U%(n*,l) 



->u(n*,l)\ 2Vn92{i) Vn " l{e) nq Vnq{£) 



V n *(£) 



V n *(£) V n *(£) n n q 



Identities (9.1) and (9.2) allow us to use again the sample growth condi- 
tion, the definition of u and the binomial property to get assumption (M3) 
and (M4). In fact, 



and 



P*(£)=u?{l) 



'■{£) b qi e^(£) 



b q 6 q {£). 



Let us now check (HI). If q\ ^ k G J^, then we shall write 



(n*,e)UI* (n*,£) 



V n "i{£) 1 v n V n *(t)V n *(£) n n qi 

For a given r ^ k in J and q\ ^ k we shall write 



(9.4) 



U^ n {n\£)U^ k {n\£) 
V n k{£) 



\u{n\£)\ 



,V nr (£) V nk (£) n qi V n "i {£) 
V n *{£) V n *(£)~k n qi 



Identities (9.3) and (9.4) allow us to use again the sample growth condition, 
the binomial property and the definition of u to get assumption (HI). In 
this case, 



(9.5) 



ri k a ?(i)=7« a ?(i) 



k. 



b k e k 



(0 



b r e r {£) 



I'll 



b qi e^(£). 



3G 



It remains to check assumptions (M3'-M4') and (Hi'). For the assumption 
(M3'), we notice that if we take qi = q, there exists a positive constant C 
such that 



(9.6) 



K>v)l 2 <C Y^W £>1 



V n i{£) 



Therefore, we have that 



lim sup E 

1=1 n * 

Obviously, 



^<cf;iimsupE^^<cf; 

l=\ n * n i 1=1 



< oo 



(9.7) 



Un q a (n*,e)\ 2 

■ ' n 1 +<,(£). 



V n 12 (£) 



V n "2 {£) 

and from (9.1) and the binomial property we actually have 

K>v)l 2 



(9.8) 



lim E 

n*— >oo 



V n "2 (£) 

for each £ > 1. For a given e > 



o 



lim ] 

n*— too 



> e 



(9.9) 



1 °° 

< lim -VE 

n*— >oo £ ^— ' 
^ oo 

= -V lim E 

£ — ' n*— >oo 

" 1 



|tC>V)| 2 



V n "2 (£) 

K>V)I 2 



V n «2 {£) 



0. 



The justification of the limit into the series in (9.9) is due to (9.6) and (9. 
which gives a constant C > such that 



E K>v)p 

V n i2 (£) 



V n i (£) 

< CE ±-L = C9 q {£);£> 1, 



and 
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for every n* £ N* 7 where £ ^(N) (by assumption X q is integrable). 

We can apply the same arguments to check assumptions (M4') and (HI'). 
This concludes the proof. 

10. Proof of Theorem 5.1. The proof is an almost direct consequence 
of Theorem 3.2. Convergence (5.4) is consequence of Theorem 3.2 and the 
only statement which has to be detailed is convergence (5.5) when d u = oo. 
We take d™+ given by (4.2) and without any loss of generality and to simplify 
notation, we set d l n * = 1. The arguments for general d l n * follow easily from 
this case. Let us define 

A(n*,N) := {j> (l,JV A <*) is non-negative} 

and we notice that A(n*) = n^ =1 A(n*, N). We claim that lim^^^ F(A(n*)) = 
1. In fact, a basic inequality among compact operators (see e.g [14]) yields 
the following a.s estimate 

(10.1) \X sq -X sq \ < ||3>o(l,<* AN)-y (l,d&. AN)\\ 

for every q = 1, . . . , J — 1 and 1 < s < d^ A N. Here || • || stands the 
strong norm over the space of bounded operators in I 2 . Since inf{A S(J ; 1 < 
s < N, 1 < q < J — 1} > 0, we may use (10.1) together with Lemma 7.2 and 
Theorem 3.1 to conclude that 

k 

lim p( H A(n*,N)) = 1 for each k > 1. 
N=l 

Hence, we do have 

k 

(10.2) lim F(A(n*)) = lim lim pf C] A(n*, N)) 

n*— >oo V / n*— >oo k— ¥00 V ' / 

N=l 

k 

= lim lim pf C] A(n*,N)) = 1. 

N— >oo n*— >oo V ' / 
N=l 

For a given i.i.d sequence of real-valued Gaussian variables A^(0, 1) 

and positive integer > 1, let us define the following sequences of £ 2 -valued 
random variables 
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X(sq) := \Z\ sq X sg ; q = 1, . . . , J - 1, s > 1; 

g = 1, . . . , J - 1; 1 < s < N 

s > N; 



; g = l,...,J-l;l<s<<& 

g = l,...,J-l;l<*<d^*AJV 
s > A iV. 

Convergence (10.2) and the inequality (10.1) yield 

\\X$-X N \$^0 

in probability as n* — > oo for each N > 1. Of course, liniTY—^QQ ^ — in 
probability in £ 2 . The strategy is to prove that for a given e > 



X N (sq):=\ ^i X 'i j 



(10.3) lim limsupPjUi'J - X n *\\p > e\ = 0. 

Under (10.3), we may exchange the iterated weak ^ 2 -limits lim at lim n * X^l = 
lim^* limiv X£l which allow us to conclude (5.5), i.e., ||AV|| 2 2 — > \\X\\g 2 
weakly as n* — > oo. By the very definition, 

J-i <%* 

(10.4) \\X$ - X n 4] 2 = E A sg Xs 9 lA(n*); i < N < 

q=l s=N+l 

and lo(l, d^*)1^4( n *) is non-negative a.s for every n* G N J . In the sequel, 
we assume that A are enumerated with algebraic multiplicities taked into 
account. By the Lidskii trace theorem 

d l* J-i 

s=l <J=1 

a.s for each n* G N^. In particular, from the fact that Yq(1, c^*)-"-A(n*) ls 
non-negative a.s we get the following bound [see e.g Corollary 3. 7. p. 56 in [14]] 
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J-l 



^A S9 a A(n *) < Yl Tr M Q {n\j)T {n\j)M Q (n\j)\ A{n * ) 
s=N+l g=l j=N+l 



(10.5) 



d n* J-l J- 3 \ 

E E 1 E^U*^) f X ^,»*0')lA(n*) O-*. 



for n* G N" 7 . We know there exists C > (which only depends on (b p )p =1 , 
see (9.6)) such that 



C2 n * 7_1 f \ <f"* T_1 f o 

12 /-N..2 i, / ^ V V ^ W 



E E E*Uo x^^M-) < c E E E 



j=N+l g=l 1=1 

(10.6) 



j=iV+l g=l I. 1=1 
X — Xjq*A{ n *) 

I in 



11,; 



a.s for every n* € N . In particular, we may assume that {xjp'i 3 ^ -W+l) 1 < 
p < J — 1} are independent of V nq for every g G J to get 



(10.7) 



j=7V+l g=l £=1 



9=1 j=N+l 1=1 q q 



By taking advantage of the independence of the samples, a straightforward 
calculation yields 



(10.8) EV n «(£)V n «(j) = n q ei(j)(l-0i(e))+n q 0i(£)n q e«(j) 



for every n q > 1, q €L J and 1 < £ < j < oo. Hence, from (10.8) we obtain 
the following bound 



J— 1 oo j 



J— 1 oo j 



g=l j=Ar+i £=i 



9=1 j=Ar+l £=1 



q U) 



+ q (£)0 q U) 
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J-1 OO „„, ... J-1 oo 3 

q=l j=jV+l 1 q=l j=N+l 1=1 



J— 1 -. oo J— 1 oo 



(io.9) E- E W) + E E ^(j)ex* 



77- 

q=l 1 j=N+l q=l j=N+l 



where ^JLn+i J^ 9 (j) < 00 f° r every 7Y > 1 due to the integrability assump- 
tion X q G L 2 (P); g = 1,. .. , J-1. Summing up the above steps (10.6), (10.7) 
and (10.9), for a given e > the following estimate holds 

1 dl * J ~K 

limsupP{||## -X n *\\% > < hmsup-E E^*? 1 ^*) 



< llimsupE E EfcW xI^O') 

e j=JV+l«=l U=l J 

J— 1 00 

< — limsupV]— V] j6 q (j) 



^ n*— >co _w 7^g 



<j=l « j=JV+l 



J— 1 00 J— 1 00 

(10.10) + E E 9 '(j') m, = E E ^(i) E ^- 

q=l j=N+l q=l j=N+l 



Finally, from (10.10) and the integrability of X q we may conclude (10.3). 

|2 _v II V||2 
\p -r \\p 



This shows \\X n * ||? 2 — >■ ||^||/2 weakly as n* — > 00 which allows us to conclude 



the proof. 

11. Appendix. In this appendix, we provide the proofs of Lemmas 7.1, 7.2, 7.3 
and Proposition 7.1. 

11.1. Proof of Lemma 7.1. 

Proof. Let us fix t > 1, n* G N* 7 and k G '. By construction, we notice 
that 
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E[A<(£)|di< 



E[AR: 



(n.i) 

Therefore E[Ar' r ^(^)|^*_ 1 ( 
us to conclude the proof. 



11.2. Proof of Lemma 7.2. 



V*{£)h*{l) 

EiAN^QCM l<m<n k . 

for 1 < m < nk. Assumption (Ml) allows 

□ 



Proof. At first, we notice that at the category i, the candidate variance 
<pg(£) is well-defined for every I G {1, . . . ,i} and q G J so let us fix such 
and a population In order to apply the classical martingale central 
limit theorem, we begin by verifying the Lindeberg condition. Indeed, it is 
sufficient to establish the conditional Liapunov condition 



Nj 



\2+5\nn 



m=l 



in probability as n* — > oo, 



for some 5 > 0. By the very definition of Rm and Doob-Meyer decomposition 
we have |AY^(£)| 2+<5 < 2 2+<5 a.s and hence we may use assumptions (Ml) 
and (M2) to find a constant C which only depends on J and 5 such that 



N, 



E E 0c 



m=l 



\2+5 Inn 



< 



n, 



V n «{l) 



2+5 



n 



—7-0 as n* — > oo. 
In order to shorten notation, let us define 



2+5 



(11.2) 
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(11.3) 



A m,n*W^ 1 ) ■ yn qi tg\ 



The quadratic variation of the martingale Sm=iC,oW a * the P om t Nj 
can be written as 



N, 



v-)l l^m-l(^) 



(ii-4) E E [ |C 

m=l 

TV, 

E E Ep^Cft,/) - ^ B *(«i,/)| 2 |SSUiW 



"1=1 (ji^g 



+ 



2 E E E[(7r^(Q lj i)-A^( ? i^))(7T^(g 2) £)-A^(g 2 ,i)) 
m=1 (gi,92)eA 8 

= :r 1 (n*,£)+T 2 K^). 
Assumption (Ml), the independence of the random sample and (11.1) yield 



[11.5) 



= e '^i!*:?' 2 ^ 



<?i^<? 



V n n (£) 



91t^<? 

Assumption (M3) then yields 



y™ 9 (£) 



(11.6) 

91^9 917^9 

in probability. The arguments used for the first term can also be applied to 
T2(n*,£) and then we shall write 



(11.7) T 2 (n*,f) = 2 



^ «i(-*.o«i(-"^ Mtl _ tfM] . 



(<?1.92)GAq 



v^(0 
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Assumption (M4) then yields 



(11.8) limT 2 (nV) = 2 £ fP {C)h« (£)[l - h*(£)}. 

n*— >oo z — ' 

(gi,g2)eAj 

in probability. Summing up (11.6) and (11.8) we conclude that 



Nj r i 
(11.9) E IC* 9 (^)| 2 |^m-i W -> <t%( £ ) in probability as n* -> oo. 

771=1 

Summing up the above steps, the martingale central limit theorem ap- 
plied to {£m i(? (^);l < Tn < Nj} ensures the weak convergence (7.1). The 

convergence lim n *_Kx> (K n * (i) = 4> 2 q {£) m probability is a consequence of 
relations (11.5) and (11.7) combined with Remark 2.3. □ 

11.3. Proof of Lemma 7.3. 

Proof. We fix q G J and a category i such that mini< p <j{8 p (i)} > 0. 
Let us denote by Z q (-) the weak limit of (7.1) in Lemma 7.2 



Nj 

(11.10) Z q (i):= lim l<i<i- 

n*— >oo * — • " 

m=l 

We also set 



(11.11) 

By the very definition 

i 

5^E[Z 9 (*)|Ji_l] = Wg(i " 1) +E[Zg(i)|Ji_l]. 

Un q (n*,-) 

Since ; 4 is F-predictable for any q\ ^ q and q2 £ J, we may use 

the martingale property of {y™ p , p € J7} and the definition of F to get 
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TJ n l ( n * i) 

(n.12) nzM^i] = j^E yia] L mY»*®\7ti} 

n*-s>oo ^ V n ii (l) L V n 1 U 

91^9 

= (weak sense). 

This shows that W q is an F-martingale. At this point, from Lemma 7.2 we 
only need to check that W q has independent increments. For this, we claim 
that (Wq) is a deterministic process. In order to shorten notation, let us 
define 

JJ n i ( n * £) 

nl( qi ,£):= J AY n «(£); Ql + q, 

for 1 < I < i. With this notation at hand, we have 

(w q )(i) = 



iim J2 E nwunj) - xuqiJ)\ 2 \^-i] 

n*— >oo — ' t — » 

1=1 q^q 



2 i im E E E [K*(9i>-0-^*(9i^)K*(92»^)-^*(92^)l^- 



=1 (9l,92)6A 



= : lim V y)Ti( g i^,n*)+2 lim V V T 2 ( qi ,q 2 ,£,n*). 

^=1 917^9 t=l (q 1 ,q 2 )£A q 

The martingale property and the independence of the random sample 
yield 

(11.13) 

E[AY n "{£)AY n n Wl-F^i] = E[AY n «(£)\ T^} x E[AY n 'i (£)\T^\] = 0, 
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for every qi ^ q. Moreover, 

E[|Ay n «(^)| 2 |^-i] = W nqi ^W^){l-h qi {i))\F^\] 

(11.14) = V 11 ^ (£)h qi [1 - h qi (£)} Vgi G J and \ < I <\. 

From equations (11.13) and (11.14), we may write 



\U nq (n* £)\ 2 

\TJ n i ( n * £)\2 

(11.15) + yn^ h q (£)[l-h q {£)\. 

The same arguments for Ti(qi,£,n*) may be applied for the second term 
and in this case the crossing terms T 2 {qi, q2,£,n*) may be written as 



T 2 ( qi ,q 2 ,£,n*) = n<*{Qi^<*iftiM^l-\] 

(n.16) = ^ {n *^ {n *^ h*m-h*m- 

Summing up equations (11.15) and (11.16) and using assumptions (M3) 
and (M4), we do have 



1=1 qi^q 

i 

+ 2 E E P q qi , q2 (t)h q m-h q (£)\- 

This shows that Z g has independent increments. Lemma 7.2 allows us to 
conclude the proof. □ 

11.4. Proof of Proposition 7.1. 

Proof. Let us fix 1 < £ < i and a weighted process U which satisfies 
assumptions (M1-M4) and (HI). In view of a Cramer- Wold argument, 



4G 



we fix an arbitrary a = (ai, . . . , aj) € M and to shorten notation we write 
£m ( £ ) : = Eg=i C* g (^K- From Lemma 7.1, we know that is a G n * (£)- 

martingale difference for each n*. Let us now check the conditions for the 
Central Limit theorem under martingale dependence. For any 5 > 0, there 
exists a constant C which only depends on J and 5 such that 



J>[|C*WI 2+5 l^-iW] < 



J iVj 

C E h 2+5 E e oc>)I 2+5 ic-iW] 

fc=l m=l 
J Nj 

c H m 2+5 E E [ic; fc »i 2+5 ic-iW] 

fc=l m=l 



(11.17) in probability as n* — > oo. 

The convergence (11.17) is due to assumptions (Ml) and (M2). Now let 
us consider the predictable quadratic variation of J2 m =i £m CO a ^ the point 
Nj as follows 



Nj J Nj 

2 e[i&*wi 2 = E h 2 E E 0c; fc wi 2 ic-iW] 

m=l fc=l m=l 

+ 2 ^ E^^KfeWCwidii 

l<r<fc< J m=l 

=: ri(^n*)+r 2 (^Ti*). 
Assumptions (M3-M4) and step (11.9) in Lemma 7.2 yield 



J 

(11.18) Ti(£,n*) -^E a ^W 

fc=i 

in probability as n* — >■ oo. It remains to investigate T2(£,n*). Let us fix 
1 < m < iVj, and a pair r 7^ in J . We shall use the notation introduced 
in (11.2) and (11.3) to write 
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Let us fix any q\ ^ k and q2 7^ r. Assumption (Ml), the independence 
of the random sample and (11.1) yield 



(11.19) ElCfe.^^te.QlCiW] = 0. 

Again we may invoke assumption (Ml), the independence of the random 
sample and the definition of Q n {£) to write the following relations 



(n*,£)U^ (n*,£) 

n4 t , n *(qi,t)K n ,n*(<h,e)\s£-iW] = oif^g 2 , 

= v*(i)[h k (e)(i-h k m 

(n*,£)U^ (n*,£) 

<"- 21 > - - 'Up ' ■»-« 
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n>kn*(<n>Q* r m,n*M\$k-lW] = ° if <?1 + T, 

= V%{£)[h r {£){l-h r {£))] 

(1L22) x w^w — qi=r - 

Summing up relations (11.19), (11.20), (11.21) and (11.22), we actually 
have 

(11.23) 

^ (n\£)U^ (n*,£) 

T 2 (t,n*) = 2 v Lu) — -f^m-f^m 

l<r<k<J qi^k ^ ' 

_ _ Up (n*, e)UZT(n*, £) . 

- 2 E Q ^E ylffl 
By making use of the assumption (HI), it follows that 

(11.24) lim T 2 (£,n*) = 2 V a r a k ip(k,r,£) 

n*— >oo ^— 

l<r<fc<J 

in probability. Therefore, an application of the central limit theorem yields 

Nj J 
m=l fc=l l<r<fc<J 

weakly as ra* — >• oo and hence Lemma 7.3 yields 

£">*(£) -> Z(£) weakly as n* ^ oo 

where Z(£) = (Zi (£),... , Zj(£)) (see (11.10)) has the Gaussian law N(0, Q{£)) 
for 1 < £ < i. Relations (11.18), (11.5), (11.7), (11.23) and Remark 2.3 allow 
us to conclude that 
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in probability as n* — > oo for 1 < i < i. It remains to check (7.3) but for 
this, we may apply the same arguments of Lemma 7.3. By using the notation 
introduced in (11.11), let us consider 

w(i) = (Wi(i),...,Wj(0). 

By repeating the same arguments as in the proof of Lemma 7.3, it is 
straightforward to check that W is an F-vector martingale with independent 
increments on the subset {1, . . . , i}. That is, Z{j) and Z(m) are independent 
M. "'-valued random variables for every m ^ j in Under these 

conditions we may conclude convergence (7.3). □ 
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